A Pilot Study of Enhancing Subject Discovery of Textual Web Resources

نویسنده

  • Kwan Yi
چکیده

The aim of this study is to explore to what degree hyperlinked external resources contribute to the automated subject-related term indexing. Empirical evidence shows no additional enhancement of performance with the additional resources. It also implies that target Web pages are closer in subject to siting pages than sited pages. Résumé : L’objectif de cette étude est d’explorer à quel degré les ressources hypertextes externes contribuent à l’indexation automatique par sujet. L’observation empirique ne montre aucune amélioration additionnelle de la performance avec les ressources supplémentaires. Ceci implique également que le sujet des pages web ciblées se rapproche davantage du sujet des pages web sélectionnant que des pages web sélectionnées.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Focussed crawling of environmental web resources: A pilot study on the combination of multimedia evidence

This work investigates the use of focussed crawling techniques for the discovery of environmental multimedia Web resources that provide air quality measurements and forecasts. Focussed crawlers automatically navigate the hyperlinked structure of the Web and select the hyperlinks to follow by estimating their relevance to a given topic, based on evidence obtained from the already downloaded page...

متن کامل

Expert Discovery: A web mining approach

Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...

متن کامل

Exploring Relevance as Truth Criterion on the Web and Classifying Claims in Belief Levels

The Web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the Web. Moreover, different websites often provide conflicting information on a subject. Several truth discovery methods have been proposed for various scenarios, and they have been successfully applied in diverse application domains. In this paper...

متن کامل

DiscOU: A Flexible Discovery Engine for Open Educational Resources Using Semantic Indexing and Relationship Summaries

We demonstrate the DiscOU engine implementing a resource discovery approach where the textual components of open educational resources are automatically annotated with relevant entities (using a named entity recognition system), so that these rich annotations can be searched by similarity, based on existing resources of interest.

متن کامل

میزان همپوشانی مقالات سیستم تنفسی در دو پایگاه اطلاعاتی Scopus و Web of Science : گزارش کوتاه

Background: Due to the overlap between the databases of the subject and content, resulting in the purchase of duplication and waste of resources, in this study, the degree of overlap between respiratory system papers indexed in the database, Scopus and Web of Science during the years 2001 to 2010 were examined. Methods: In this survey study, researcher followed by obtaining percent overlap i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007